Target Neighbor Consistent Feature Weighting for Nearest Neighbor Classification

نویسندگان

  • Ichiro Takeuchi
  • Masashi Sugiyama
چکیده

We consider feature selection and weighting for nearest neighbor classifiers. Atechnical challenge in this scenario is how to cope with discrete update of nearestneighbors when the feature space metric is changed during the learning process.This issue, called the target neighbor change, was not properly addressed in theexisting feature weighting and metric learning literature. In this paper, we proposea novel feature weighting algorithm that can exactly and efficiently keep track ofthe correct target neighbors via sequential quadratic programming. To the bestof our knowledge, this is the first algorithm that guarantees the consistency be-tween target neighbors and the feature space metric. We further show that theproposed algorithm can be naturally combined with regularization path tracking,allowing computationally efficient selection of the regularization parameter. Wedemonstrate the effectiveness of the proposed algorithm through experiments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

An Improved K-Nearest Neighbor with Crow Search Algorithm for Feature Selection in Text Documents Classification

The Internet provides easy access to a kind of library resources. However, classification of documents from a large amount of data is still an issue and demands time and energy to find certain documents. Classification of similar documents in specific classes of data can reduce the time for searching the required data, particularly text documents. This is further facilitated by using Artificial...

متن کامل

A Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection

K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algori...

متن کامل

Weighting Unusual Feature Types

Feature weighting is known empirically to improve classification accuracy for k-nearest neighbor classifiers in tasks with irrelevant features. Many feature weighting algorithms are designed to work with symbolic features, or numeric features, or both, but cannot be applied to problems with features that do not fit these categories. This paper presents a new k-nearest neighbor feature weighting...

متن کامل

Weighting and selection of features

Several methods for feature selection and weighting have been implemented and tested within the similarity-based framework of classification methods. Features are excluded and ranked according to their contribution to the classification accuracy in the crossvalidation tests. Weighting factors used to compute distances are optimized using global minimization procedures or search-based methods. O...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011